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Abstract 

We obtain an approximate Gaussian distribution from a Poisson dis- 
tribution after doing a change of variable. A new chi-square function is 
obtained which can be used for parameter estimations and goodness-of-fit 
testing when adjusting curves to histograms. Since the new distribution 
is approximately Gaussian we can use it even when the bin contents are 
small. The corresponding chi-square function can be used for curve fitting. 
This chi-square function is simple to implement and presents a fast conver- 
gence of the parameters to the correct value, especially for the parameters 
associated with the width of the fitted curve. We present a Monte Carlo 
comparative study of the fitting method introduced here and two other 
methods for three types of curves: Gaussian, Breit-Wigner and Moyal, 
when each bin content obeys a Poisson distribution. It is also shown that 
the new method and the other two converge to the same result when the 
number of events increases. 
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1 Introduction 



Nowadays there is an intensive experimental effort to search for new phenomena 
in many fields of particle physics. These signals should be rare events where one 
needs to treat low statistics data and non-Gaussian errors. In order to handle 
these data one needs special care, since all the usual Gaussian procedures and 
techniques are no longer valid. On the other hand, it turns out that the mini- 
mization of the chi-square functions is normally simple, relatively fast and a very 
familiar way of fitting data, estimating parameters and their errors so as the 
goodness-of-fit testing. The of fitting a theoretical curve to experimental data 
in order to estimate some parameter values is to minimize the function 



where is a known function(model prediction) calculated at the point Xt 

and a is a vector of the parameters one wants to obtain; rii is the measured 
experimental value associated with the bin located at Xi and is the number 
of bins in the range of interest. This method has been largely used but presents 
some limitations such as the assumption that each bin content must obey a 
Gaussian distribution of spread cxj or the measurement errors are at least almost 
" Gaussian" . If the contents of each bin obeys a Poisson distribution some authors 
[0 recommend that each bin content must have at least a statistically significant 
number of entries rii such as that Ci ~ y^, where the asymptotically limiting 
case for large number of measurements starts, and the square root of the variance 
can be considered as a good interval estimation. For values smaller than these, 
there are suggestions that one must use bins of variable sizes such that each bin 
content be greater than the statistically significant number of entries n^. The 
disadvantage of these suggestions is that one can lose important information 
about the structure of the studied distribution. There is no rule of thumb for the 
bin width or for the ideal number of bins in which the region of interest should 
be divided. There is yet another hint that the ideal case is to use bins of variable 
width of equal probability contents |@]. 

In a very interesting paper. Baker and Cousins|0], call attention and discuss 
some topics such as point estimation, confidence interval estimation, goodness- 
of-fit testing, biased estimation, etc. when fitting curves to histograms using 
chi-square function. 

They presented a Xbc function for fitting histograms when the bin contents 
obey a Poisson distribution. They defined a Poisson likelihood chi-square which 
is given by Eq.(^ below. 




(1) 



N 



xlc = 2^1 [/(a^i.a) 



rii + riilog {rii/ f{xi,a))] 



(2) 



i=l 
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This function behaves asymptotically as a chi-square distribution and then 
can be used for estimation and goodness-of-fit testing. 



The chi-function presented in this paper has also asymptotically a behavior 
like the classical chi-square function, a fast convergence to the correct value, 
much less fluctuations when compared with Xg ^^'^ works also when one has 
distributions with long tails where bin contents can be very low. It is of easy 
implementation in any minimization program. 

After this introduction, we demonstrate in section 2 how one can transform 
a non-Gaussian pdf into an approximate Gaussian pdf. Section 3 is devoted to 
obtain the chi-square function for the approximate Gaussian pdf from a Poisson 
distribution and discuss some of its characteristics. The next section, 4, we 
compare with the results obtained using the Eq.(|I|) and Eq.(^, and the new chi- 
square function for different number of entries in the histograms. This comparison 
is made using Monte Carlo events generated according to some distributions with 
known parameters, as suggested by and in section 5 we present the conclusions. 
In the appendix we obtain the equivalent chi-square expression for a binomial 
distribution. 

2 Obtaining an approximate Gaussian distribu- 
tion 

The basic motivation is to transform, via a variable transformation, a non- 
Gaussian probability density function(pdf) into an approximately Gaussian pdf 
preserving the probability even when one has small number of events Let 
us consider a non-Gaussian pdf p{x) and one wants to obtain a transformation 
such that the probability is preserved Eq.(|^) and that the new pdf q{z) should 
be approximately Gaussian: 



When a pdf is unimodal and obeys some regularity conditions the loga- 
rithm of it is approximately quadratic so that 




(3) 



Then q{z) can be written as 




(4) 



log{j){x)) ^ log{jp{x)) 



2 



1 




log{p{x)) 



dx'^ 



) 



X 




(5) 
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where x is the point associated to the maximum of p{x) and one can define the 
following quantity 

(6) 

On the other hand, the logarithm of a Normal pdf g{x), with mean fi and 
standard deviation a, is of the form: 

log {g (x)) = const - ^{x - fif (7) 

so that given the location parameter fi, it is completely determined by its standard 
deviation a. 

A comparison between Eq.(^) and Eq.([^) shows that the variance of the pdf 
p{x) is approximately equal to J"^(x). Let us suppose now that z{x) is a one- 
to-one transformation between x and z, then using the chain rule for derivatives 
one gets the relation 



d'^log{p{x)y 
dx"^ 





dx 


). 


dz 



J{x) 



dx 
dz 



(8) 



Let us choose zix) such that 



' dx^ 
dz 



(9) 



This choice is made so as to make J{z) independent of z, the standard de- 
viation equal to one and the new distribution q{z) approximately translation 
invariant along the z axis. Thus the metric can be obtained from the relationship 
obeying the above conditions 



dz 



dx 



X) 



(10) 

J^/\t)dt (11) 

3 The Chi-square Function 

As an example, let us apply the above prescription to a Poisson pdf given by 

(12) 



Pp[x) 



T{k + 1) 



which means that after observing k events one has a pdf of the estimated mean 
parameter x. Now one wants to find a Gaussian hke pdf through a transformation 
of X. The location of the pp{x) maximum is easily shown to be at a; = and the 
term associated to the second derivative is 



d'^log{p{x)) k 
dx"^ x"^ 
Using the fact that x = k, one obtains 



(13) 



J(x) = i (14) 

then 

J^/'(x) = ^ (15) 



(16) 



'X 

Using Eq(|), the one-to-one transformation is obtained as 

dz 1 

dx y^/x 



(17) 

2v^ (18) 



^.2 



^ = ( 2 j (19) 



and the inverse transformation is 



Reusing the above expression in Eq.([T^) and using Eq.(H), one obtains an 
approximately Gaussian expression qp{z) associated to the Poisson pdf pp{x), 
which is 

It is not difficult to show that the above expression is normalized, J{z) = 1 
and is translation invariant by construction. The approximate Gaussian and 
exact Gaussian distributions, gp(z) and g{z), respectively, are shown in Fig. 1 for 
different values of k {k = 0, 1,2,5 and 10). The worst approximation occurs at 
k = but it gets better very fast as k increases. 

This pdf qp{z) has a maximum at z = \/4k + 2 which corresponds to x^ = 
k + 1/2. It is interesting to note that Xz is between x = k and the median of the 
Poisson pdf, ~ ^ + 2/3, i.e., the maximum of pp{x) is not directly related to 
the maximum of qp{z) via Eq.(|l^) and Eq.(O). 
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It is not difficult either to obtain an analytical expression for the confidence 
intervals for roughly 68.3% of confidence level since the obtained qp{z) has a 
maximum at z = ^/Ak + 2 and a standard deviation equal to unit, one gets 



V4A; + 2-l,V4A; + 2 + l (21) 



Taking the inverse transformation one gets the corresponding interval associated 
with the original Poisson pdf 



^min/'^) 1 (^ma2:/2) (^2) 



The confidence level calculated according to Eq.(0)and Eq. (|2^) is shown 
in Table 1 for different values of k, and so are the probability contents of the 
calculated interval. One can see that these intervals overestimate the 68.27% 
confidence level but converge to it as k increases. 

Let us suppose that one wants to fit a set of data when the contents of each 
bin obeys a Poisson distribution. If the contents of each bin has small numbers of 
events, one can no longer use as its standard deviation cxj = since the errors 
are asymmetrical and consequently one can not use the least-square fit method, 
Eq.(|l|), either since it works only for Gaussian pdf. If one insists one could get 
large deviations for the estimated parameters as is shown in the figures of the 
next section. After taking the above transformation, the contents of each bin is 
approximately Gaussian. 

Using the likelihood ratio test theorem ^, one gets the following expression 
in terms of bin contents nj 

TV 

xl = -2Y^log{\) (23) 



where 



= ^ z ri (24) 



which gives 



N 



i=l 



( 2n - + 1 

2{f{xi,d) - rii) + {2ni + l)log 



2/(x,,a) + l, 



(25) 



which asymptotically behaves like a chi-square distribution. This expression is 
similar to Eq.(|D obtained by p. 

One can also derive an equivalent expression for a binomial distribution. This 
is shown in the appendix. 
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4 Comparing the different chi-square functions 



Let us now compare the results obtained by minimizing Eq.(|T]), Eq.(0) and 
Eq.(P3|) when the bin contents obeys a Poisson distribution. This compari- 
son was made for three different curves: Gaussian, Breit-Wigner and Moyal, 
Eq.(|26|),Eq.(^) and Eq.(p8|), respectively. 



fcix) (X e 2^2, (26) 



fswix) oc — — — (27) 

(2o-|j^y + {x- ^BwY) 

/M(x)oce-^-e"'' (28) 

where h = {x — fiM)/o'M- 



The first two are symmetrical curves with "short" and long "tails", respec- 
tively, while the last one is an asymmetrical function with a left "short" and a 
right "long" tails. 

The parameters {yUj} are associated to the maximum value of the distribution 
and {cTj} are related to the spread of the distribution, where j = G, BW, M. 

One generates random points with known fij and aj according to each of the 
above distributions and filling histograms with 100 bins, see Table 2. 

The number of entries ranges from 20 to 10^ and for each fixed number of 
entries, 10^ sets of points were generated. 

For each fixed number of entries we calculate the average value of the fitted 
parameters fij and (Tj which are the estimators of jlj and aj, respectively, using 
Eq.([I|), Eq.(g) and Eq.(g§) and the MERLIN optimization package §. 

The fit was done from the first bin to the last bin content different from zero, 
although there could exist bins of contents equal to zero in between, except for 
Eq.(|l|) where the bins of contents equal to zero were excluded in order to avoid 
singularities. One also calculates the expected mean errors defined as Aaj = 
< aj — aj >^ and Afij = ^ < fij — fij >^ of the fitted fij and aj with respect 
to the "true" known fij and aj, fixing the number of entries. 

These results are summarized in Figs. 2- 13 in terms of the number of entries. 
We can see clearly that the minimization of Xp shows faster or equal conver- 
gence to "true" value and smaller or equal expected mean errors as the number 
of entries increase than when using Xbc- Both these functions are systematically 
much better than xg for the convergence and expected mean errors of the param- 
eters. For small number of entries, we can also notice that the parameters ai are 
systematically overestimated for all the shown cases while fiM for the Moyal case 
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is underestimated. All the clii-square functions here presented converge system- 
atically to the correct value as the number of entries increase. One can observe in 
all figures that all three methods coincide as the number of entries increase. We 
can clearly see the advantage of using the chi-square function introduced here. 
It converges equally well or faster and has equal or smaller expected mean errors 
than the other two methods. Besides, Eq.(p5|) is also of easy implementation in 
optimization programs . 

5 Conclusions 

This article presented an improvement over the usual minimization of chi-square 
function technique for fitting functions in order to extend its applicability to low 
statistics data when one has asymmetrical errors. This new method is obtained 
through the change of variables such that one gets approximately Gaussian pdf 
when transforming the original pdf. The approximate Gaussian pdf is obtained 
associated to a Poisson pdf and a chi-square function is adapted for this new 
Gaussian like pdf. Monte Carlo generated events show an improvement in low 
statistics region, as expected, although the results converge to the "true" value 
as the number of events increases. This method has shown a fast convergence to 
the correct parameter values especially to the parameter associated to the curve 
spread, aj. The proposed chi-square function is consistent since the fitted param- 
eters converge to the true value of the parameters and their expected mean errors 
decreases as the number of observations increases. The results are simple and of 
easy use in standard optimization procedures. A similar result was obtained in 
the appendix for binomial distributions. 

Acknowledgments: The authors are very grateful for discussions with A. Ra- 
malho and A. Vaidya. This work was supported in part by the following Brazilian 
agencies: CNPq, FINE?, FAPERJ, FJUB and CAPES. 
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A Chi-square Function for Binomial Distribu- 
tion 



Let us now derive a Xb case when each bin content obeys a binomial 

distribution, this is the usual case when dealing with uncorrelated normalized 
data. The binomial distribution Pb{x), normalized with respect to x, can be 
written as 

, , r(n + 2)x^(l - , , 



where x e [0, 1], then 



r(A; + l)r(n - A; + 1) 



J{x) = 



dHog{pB{x)y 
dx"^ 



x{l — x) 

Supposing again that z{x) is a one-to-one transformation, then 



(30) 



J{z) = -I 
Let us choose dx/dz such that 



'dHog{pB{x)y 
dx'^ 



dx 
dz 



(31) 



dx 
dz 



= ^Jx{l-x) 
then inverting the above equation one gets 



Integrating the latter expression 



rx \ 

= / r- 



arcsin(2x — 1) 



dt 



(32) 



(33) 



(34) 



Then the approximately Gaussian pdf associated to the binomial pdf above 



is 



r(n + 2) 



2"+ir(A; + l)r(n - A; + 1) 



-(1 + sin(^))^(l - sm{z)Y"-''^ cos{z) (35) 
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where z e [— 7r/2, 7r/2]. This function has a variance — l/(n + 1) which is 
independent of k and its maximum is located at 



z — arcsm 



arcsm 



'2A; - n\ 

n + 1 ; 

^ 1 + 1/n ^ 



(36) 



The approximate 68.27% confidence hmits are given by the analytical expres- 



sion 



(2k/n-l\ 1 
arcsm | — — — I =F 



V 1 + 1/n ; ' v/^m ^^^^ 

and after using the likelihood ratio test theorem, one obtains the chi-square func- 
tion for a binomial pdf 



N 

E 

i=l 



{2ni + l)log 



2ni + 1 



+ (2(n - rii) + 1) log 



2nf{xi, a) + lj 

2{n - ni) + 1 



^2n(l-/(x,,a)) + l^ 
where < /(xj, 5) < 1 are the estimated parameters. 



(38) 
(39) 
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Figure captions 



Fig.l The approximate Gaussian probability density function(pdf) associ- 
ated to a Poisson pdf for A; = 0, 1, 2, 5 and 10 and the exact Gaussian 
distribution with variance equal to one. 

Fig. 2 The average value of the fitted Gaussian parameter /ic as a function 
of the number of entries. The curves P, BC and LSM correspond to 
chi-square functions Xp, Xbc Xgj respectively. 

Fig. 3 The expected mean error A/ig of the fitted parameter /ig as a func- 
tion of the number of entries. The curves P, BC and LSM correspond 
to chi-square functions Xp, Xbc Xg^ respectively. 

Fig. 4 Same as Fig. 2 but for the parameter ac- 

Fig. 5 Same as Fig. 3 but for the parameter aa- 

Fig. 6 The average value of the fitted Breit-Wigner parameter hbw as a 
function of the number of entries. The curves P, BC and LSM corre- 
spond to chi-square functions Xp, Xbc Xgj respectively. 

Fig. 7 The expected mean error A/ibw of the fitted parameter /ibw as 
a function of the number of entries. The curves P, BC and LSM 
correspond to chi-square functions Xp, Xbc Xg> respectively. 

Fig. 8 Same as Fig.6 but for the parameter (Jbw- 

Fig. 9 Same as Fig.7 but for the parameter asw- 

Fig. 10 The average value of the fitted Moyal parameter function 
of the number of entries. The curves P, BC and LSM correspond to 
chi-square functions Xp, X^bc respectively. 

Fig. 11 The expected mean error AjiM of the fitted parameter func- 
tion of the number of entries. The curves P, BC and LSM correspond 
to chi-square functions Xp, Xpc xh-i respectively. 

Fig. 12 Same as Fig.lO but for the parameter um- 

Fig. 13 Same as Fig.ll but for the parameter um- 
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k 






X ffftTt 

( Zyr> '1 y 1 / 2 ) 

V Hit li 1 / 


\ 11 LU:,h / / 


■'Vmin ^ ' 

C""" pp(x)dx 


V4fc + 2-1 


^jAk + 2 + 1 





0.414213562 


2.414213562 


04289321873 


1 457106781 


7251045244 


1 


1 449489743 


3 449489743 


0.5252551288 


2 974744873 


6990903884 


2 


2 162277660 


4 162277660 


1 168861170 


4 331138830 


6926877287 


3 


2.741657387 


4.741657387 


1.879171307 


5.620828695 


0.6898630257 


4 


3 242640687 


5 242640687 


2 628679658 


6 871320343 


6882789039 


5 


3.690415760 


5.690415760 


3.404792120 


8.095207880 


0.6872666311 


6 


4.099019514 


6.099019514 


4.200490245 


9.299509758 


0.6865643067 


7 


4.477225575 


6.477225575 


5.011387213 


10.48861279 


0.6860486195 


8 


4.830951895 


6.830951895 


5.834524053 


11.66547595 


0.6856539615 


9 


5.164414003 


7.164414003 


6.667792998 


12.83220700 


0.6853422256 


10 


5.480740698 


7.480740698 


7.509629650 


13.99037035 


0.6850897795 


20 


8.055385138 


10.05538511 


16.22230713 


25.27769258 


0.6839191522 


30 


10.04536102 


12.04536102 


25.22731950 


36.27268053 


0.6835159909 


40 


11.72792206 


13.72792206 


34.38603895 


47.11396103 


0.6833119043 


50 


13.21267040 


15.21267040 


43.64366478 


57.85633518 


0.6831886429 


60 


14.55634919 


16.55634919 


52.97182543 


68.52817463 


0.6831061362 


70 


15.79285562 


17.79285562 


62.35357215 


79.14642778 


0.6830470354 


80 


16.94435844 


18.94435844 


71.77782073 


89.72217918 


0.6830026194 


90 


18.02629759 


20.02629759 


81.23685120 


100.2631488 


0.6829680213 


100 


19.04993766 


21.04993766 


90.72503123 


110.7749689 


0.6829403087 



Table 1: Approximate 68.3% confidence limits, [zmin, ^max] and [xmim^max], for 
the almost Gaussian and Poisson distributions, respectively, for different k values. 
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Distribution 






Range 


Gauss 


0.0 


1.0 


[-5.0,5.0] 


Breit-Wigner 


50.0 


2.0 


[0.0, 100.0] 


Moyal 


2.0 


0.5 


[0.0,8.0] 



Table 2: Distributions, their parameters and the range divided in 100 bins ac- 
cording to which the random numbers were generated. 
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0.015 




-0.015 



10' 



3 4 5 6 7 8 ^q2 



3 4 5 6 7 8 ^q3 



3 4 5 6 7 8 ^q4 



Number of Entries 




0.5 



2 3 4 5 6 7 8 ^q2 2 3 4 5 6 7 8 ^q3 2 3 4 5 6 7 8 ^q4 

Number of Entries 




Number of Entries 



50.2 



i-S 



50.1 



A 

JQ 50.0 
V 



49.9 



49.8 




10' 



3 4 5 6 7 8 ^q2 



3 4 5 6 7 8 ^q3 



5 6 



^«10* 



Number of Entries 




1.5 



2 3 4 5 6 7 8 ^q2 2 3 4 5 6 7 8 ^q3 2 3 4 5 6 7 8 ^q4 

Number of Entries 



0.4 



10' 



3 4 5 6 7 8 ^q2 



5 6 7 8 



3 4 5 6 7 8 ^q4 



Number of Entries 



